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Modularity maximization is the most popular technique for the detection of community structure 
in graphs. The resolution limit of the method is supposedly solvable with the introduction of 
modified versions of the measure, with tunable resolution parameters. We show that multiresolution 
modularity suffers from two opposite coexisting problems: the tendency to merge small subgraphs, 
which dominates when the resolution is low; the tendency to split large subgraphs, which dominates 
when the resolution is high. In benchmark networks with heterogeneous distributions of cluster 
sizes, the simultaneous elimination of both biases is not possible and multiresolution modularity 
is not capable to recover the planted community structure, not even when it is pronounced and 
easily detectable by other methods, for any value of the resolution parameter. This holds for other 
multiresolution techniques and it is likely to be a general problem of methods based on global 
optimization. 
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I. INTRODUCTION 

The detection and analysis of communities in 
graphs [IJ [2] is one of the most popular topics within the 
modern science of networks [3HTU]. In the latest years 
an increasing number of large networked datasets includ- 
ing millions or even billions of vertices and edges have 
become available, and a traditional analysis based on lo- 
cal network properties and their global statistics (e.g., 
degree distributions and the like) provides but a partial 
description of the system and its function. Communities 
(also called clusters or modules) are subgraphs including 
vertices with similar features or function, and their iden- 
tification may disclose not only such similarities among 
vertices, which are often hidden, but also how the system 
is internally organized and works. 

Vertices belonging to the same community have a con- 
siderably higher probability of being linked to each other 
than vertices belonging to different clusters. Therefore 
a community appears as a region of the network with a 
high density of internal links, much higher than the aver- 
age link density of the graph. The most popular method 
to detect communities in graphs consists in the optimiza- 
tion of a quality function, the modularity introduced by 
Newman and Girvan [TTJ [12] . Modularity quantifies the 
deviation of the internal link density of the clusters from 
the density one expects to find within the same groups 
of vertices in random graphs with the same expected de- 
gree sequence of the network at study. The idea is that 
vertices linked to each other in a random way should not 
form communities, as high values of the link density can- 
not be attained. Consequently, high values of modularity 
are supposed to indicate "suspiciously" high values of in- 
ternal link densities for the subgraphs, which are then 
distinct from groups of randomly linked vertices and can 
be deemed as true communities. While this is actually 



not true p~3j [14], the optimization of the measure has 
been widely used in the past years. 

Recently it has been pointed out that modularity opti- 
mization has a number of problems. In particular, it has 
a resolution limit [15] , that leads to the systematic merger 
of small clusters in larger modules, even when the clus- 
ters are well defined and loosely connected to each other. 
A more recent analysis of the resolution limit has led to 
the conclusion that the modularity landscape is "glassy" , 
and includes an exponentially growing (with system size) 
number of local maxima whose values are very close to 
the absolute maximum of the measure, even if the corre- 
sponding partitions may be topologically quite different 
from each other [16]. This implies on the one hand that 
it is not too difficult to find a good approximation of the 
modularity maximum for many techniques, on the other 
hand that the maximum is essentially unreachable. A 
recent comparative analysis of community finding algo- 
rithms has indeed revealed that modularity fails to prop- 
erly identify clusters on benchmark graphs with built-in 
community structure, and that other methods are much 
more effective [T7j . 

Nevertheless, modularity optimization is still being 
used. The main reason is the claim that the resolution 
limit can be removed by adopting suitable multiresolu- 
tion versions of modularity, like those introduced by Re- 
ichardt and Bornholdt [18] and by Arenas, Fernandez 
and Gomez [19]. In these variations, a tunable resolu- 
tion parameter enables one to set the size of the clusters 
to arbitrary values, from very large to very small. How- 
ever, real networks are characterized by the coexistence 
of clusters of very different sizes, whose distributions are 
quite well described by power laws [20-22 . Therefore 
there is no characteristic cluster size and tuning a reso- 
lution parameter may not help. Indeed, in this paper we 
show that multiresolution modularity is not capable to 



identify the right partition of the network in realistic set- 
tings and that therefore it does not solve the problems of 
modularity maximization in practical applications. The 
problem is that modularity maximization is not only in- 
clined to merge small clusters, but also to break large 
clusters, and it seems basically impossible to avoid both 
biases simultaneously. This applies to other multiresolu- 
tion methods as well and is probably a general feature of 
methods based on the optimization of a global measure. 
The paper is structured as follows. In Section [IT] we 
present a general analysis of some relevant mathematical 
properties of multiresolution modularity, with respect to 
the merger or split of subgraphs, leading to the iden- 
tification of a range of values of the resolution param- 
eter where modularity should be safe from the above- 



mentioned problems. In Section III we test the result on 
realistic benchmark graphs with community structure, 
showing that it is often impossible to find a value of the 
resolution parameter that delivers the planted partition. 



Conclusions are reported in Section IV 



II. 



THE PROBLEM OF MERGING AND 
SPLITTING CLUSTERS 



B 




Figure 1: (Color online) Schematic representation of the prob- 
lem of merging versus splitting subgraphs. Here A and B are 
two subgraphs, the problem is whether one yields a higher 
value for modularity by merging them in a single subgraph or 
by keeping them separated. The parameters involved in the 
decision are the number of internal links in A and B (mul- 
tiplied by 2), k^ n and kf n , the number of links v between A 
and B (here v — 3), the number of links / between A and 
vertices belonging neither to A nor to B (here / = 4), and its 
equivalent r for B (here r — 2). 



A. Multiresolution modularity 



Our conclusions are not significantly affected by the 
specific modularity formula one chooses, as we will show 
in Section |III| For the analytical discussion of this Sec- 
tion we adopt the generalized modularity Q\ proposed 
by Reichardt and Bornholdt [18] . which reads 



Q* = £ 
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where the sum runs over all the clusters, 2M is the to- 
tal degree of the network, k^ ot is the sum of the degrees 
of vertices in module S and kf n is twice the number of 
internal edges in module S. So, we have kf ot = kf n only 
if the module is disconnected from the rest of the graph. 
Here A works like a resolution parameter: high values of 
A lead to smaller modules because the term (k^ ot /2M) 2 
in the sum of Eq. (IT]) becomes more important and its 
minimization, induced by the maximization of Q\, favors 
smaller clusters. 

We ask when it is proficuous for modular- 
ity to keep two subgraphs together or separate. 
For this, we need to compute the difference 
AQ\ = Q\ (partition with merged subgraphs) — 
Qx (partition with separated subgraphs): if AQa > 
modularity would be higher for the partition where the 
subgraphs are merged, otherwise the split would be 
more convenient. 

We indicate with A and B the two subgraphs (see 
Fig. [I}. Let Q^~ B and Q$ UB denote the value of mod- 
ularity when A and B are kept separated and merged, 



respectively. 
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where v denotes the number of links joining A with £?, / 
the number of links joining A with the rest of the network 
(excluding B) and r is the equivalent of / for B. For 



~)x UB we have: 
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The difference AQx 
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To simplify a little Eq. Q we can define A = 2MAQ> 

A 



M 

Mkf n + kg + l + r)+v 2 

X M • (5) 



Modularity is higher for A and B merged if and only if 
A > 0. 

Eq. ([5| is rather general but we are just interested 
in testing modularity for some special cases, for which 
calculations are easy. Here in particular, we will consider 
the case I = r = rj and k^ n = kf n = £. Eq. (|5| becomes 



A = 2v - A 



(t + v + v) 2 
M 



(6) 



These results are essential to follow the discussion of 
the next subsections. 



B. Splitting clusters 

Despite the different approaches to the problem of de- 
tecting clusters in networks, there are some general ideas 
which are shared by most scholars. One of them is that 
a random graph has no communities, so it should not 
be split by an algorithm in smaller pieces, with the only 
exception of the trivial split in singletons, i.e. in groups 
containing each just a single vertex, which is still an ac- 
ceptable answer. Another shared belief is that a complete 
graph (or clique), i.e. a graph whose vertices are all con- 
nected to each other, is a perfect community (due to the 
fact that the internal link density reaches the highest pos- 
sible value of 1). So, if cliques are just loosely connected 
to each other, one would expect that a good method 
should detect them as separate clusters. We would like 
to find the mathematical conditions, in particular the 
choice of the resolution parameter A, that satisfy both 
requisites. In this subsection we search for the condition 
to avoid the splitting of random subgraphs, while the 
condition to avoid the merger of cliques will be given in 
the next subsection. 

Let us consider a random subgraph S with total degree 
2Ms, which is part of a larger network with total degree 
2M . The goal is to check under which condition S is split 
by optimizing modularity. Here for simplicity we consider 
only bi-partitions. The expected optimal modularity Q2 
for the bipartition of a random graph has been computed 
by Reichardt and Bornholdt [23 



Q RB = 0.765 






(7) 



where the brackets indicate expectation values over the 
ensemble of random graphs with the same expected de- 
gree sequence of the subgraph at study. 

We now express Q2 in terms of the number of edges 
v between the clusters of the bipartition with optimal 
modularity. We obtain 



2M S Q 2 = 2M S - 2v 



k\ 



k\ 



2k A k 



A^B 



2M S 



2M S 



2v, (8) 



where k A (&b) is the total degree of module A (B). Since 
modularity is optimal when the two modules are of about 



equal size, i.e. when k A ~ ks ~ Ms, we have: 

2M S Q 2 =M S - 2v, (9) 

from which we can derive v, 

v = M s {^-Q 2 ). (10) 

For Q 2 = we would have v = Ms/2, which is the 
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Figure 2: (Color online) The plot shows as measured on 
Erdos-Renyi and scale free graphs. For each type of graph 
we plot the analytical estimate of Reichardt and Bornholdt 
(RB) and a numerical estimate obtained by optimizing mod- 
ularity with simulated annealing (SA) [13]. The minimum 
cut v = as x Ms was measured by optimizing modularity for 
different values of A over the set of bipartitions. To optimize 
modularity, we are looking for small values of v and equal 
values of k A and k B , so tuning A just controls the importance 
of either requirement. However, simulations show that the 
dependence on A is quite weak, validating our approximation 
k A ^k B . 



expected average number of links joining two modules 
of equal size, arbitrarily chosen. Eq. (10) implies that 



optimizing modularity decreases the number of expected 
links between the modules, with respect to arbitrary bi- 
partitions, while it increases the internal density of links 
of the modules. One also sees that, for v to be positive, 
Q2 < 0.5. Actually, in the calculation of Reichardt and 
Bornholdt, this holds only if (k) is big enough. To give 
an idea of the numbers that one could have, Q2 ~ 0.17 
when all vertices have degree k = 20, so v ~ 0.33 x Ms 
which is actually a not too bad approximation also for 
other degree distributions (for all vertices having degree 
k = 10, v ~ 0.25 x Ms)- Let us call as this proportion- 
ality factor between v and M5, 



v = asMs and kf n = kf n = (1 
From Eqs. Q, ([l0| and ([IT) we get 

{Vk), 



as 



0.765 
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a s )M s . (11) 



(12) 



In Fig. [2] we compare the values of as from Eq. (12) 
with numerical estimates derived by putting in Eq. ( 10 ) 
the maximum modularity Q2, derived with simulated an- 
nealing. The calculation of Q2 is carried out for different 
values of A, but the results seem to be essentially inde- 
pendent of A. We consider both Erdos-Renyi (ER) and 
scale-free (SF) graphs, with 1000 vertices and average 
degree (k) = 20 (left panel) and 10 (right panel). The 
SF graphs have degree exponent 2. As we can see from 
Fig. [2j the analytical estimate of Eq. ( [l2| ) yields a good 
approximation of as • 

Let us now consider our splitting-merging problem, 
considering A and B as candidates. We set n = 1, which 
means that only two links come out of S (ideally one 
from A, the other from B). In this case, we would like to 
have A > 0, to avoid the split of the random subgraph 
S. From Eq. ^ and Eqs. (11) we get (remember that 



D. Condition on the ineliminability of the bias 



2a s M s > 



X(M S + l) 2 
M 



which implies 



A< 



2a s M 
M s ' 



(13) 



(14) 



Alternatively, we can incorporate the correction factor 
[Ms /(Ms + l)] 2 ~ 1 in as, so that we call as what is 
actually as[Ms/(Ms -f l)] 2 . If the subgraph is a clique, 
as ~ 0.5, and modularity can even split a clique when 



M s 



C. Merging clusters 



(15) 



Let us now consider two equal sized subgraphs con- 
nected with one edge (v = 1 and n = 1) and let 
ktn — kf n = ic- Eq. m becomes: 



A 



fe + 2) 2 
M 



(16) 



In this case we want A < (we wish to keep the two 
subgraphs separated), which implies 



A > A c 



2M 



(£c + 2) 2 



(17) 



If & is very small, A has to be very big (for Xc > 1 the 
subgraphs cannot be resolved by standard modularity, 
which corresponds to A = 1, and we recover the resolution 
limit of Ref. [15]). On the other hand if & is large, the 
subgraphs will be resolved for a large range of A- values. 
If the subgraphs are two cliques of nc nodes each, for 
instance, £c — nc(nc — 1). 



We now put together conditions (14) and (17) 
have that 



A 2 < A < Ai 



where 



Ai 



2a s M 
M s 



and A2 



2M 



(&+2)2 



We 



(18) 



(19) 



Above Ai, modularity splits random subgraphs, below A2 
it puts together subgraphs even if they are connected by 
just one link (even in the case in which they are cliques). 
In the range between Ai and A2 it should be possible to 
avoid both biases. However, if 

Ai < A 2 , (20) 

the biases cannot be both simultaneously lifted. Eq. 
holds when, by setting Ms /as = As, 



(£c+2) 2 <As. 



(21) 



Note that Eq. (I21| does not depend on the size of the 
whole network, either in terms of vertices or edges. 

To be more concrete we consider a simple example. We 
examine a network made out of two identical cliques of nc 
vertices each and an internally random subgraph of ns 
vertices and average degree (k) s- The three clusters are 
all connected to each other by one edge only (see Fig. [3| . 
In Fig.[4]we plot the relation between nc and ns coming 




Figure 3: (Color online) Schematic network with two cliques 
and a random subgraph, which are the natural communities 
of the network. 



from the equality Ai = A2 (obtained turning the inequal- 
ity of Eq. pl| ) to an equality) for some values of (k)$- 
We used Eq. ( [l2|) to evaluate a^, with the approxima- 
tion (vk)s = \/(k)s an d the relations & = nc(nc — 1) 
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Figure 4: (Color online) This plot shows Eq. (21) as a func- 
tion of ns and nc for the simple network with three clusters 
of Fig. [3] Above the curves modularity cannot find the right 
partition for any value of A. 



and Ms = ns(k)sj2. For any given value of (k)s, the in- 
equality of Eq. (21 ) holds above the corresponding curve. 

In Fig. [5] we plot Ai and A2 as a function of n<s, for 
nc = 13 and (k)s = 100. For Ai we show two curves, one 
corresponding to the exact function, determined numer- 
ically, while for the other we have used the theoretical 
approximation of as described above. The lines divide 
the X — ns plane in four areas, characterized by the pres- 
ence or absence of the two biases. As we can see, the 
portion of the plane in which both biases are simultane- 
ously absent (gray area) is quite small. 

One might still wonder that it could be possible to find 
a value of A high enough that the random subgraph S is 
split in ns vertices and the two cliques are still correctly 
detected. Let us consider Eq. 5 when A consists of a 
single vertex, so that v is the internal degree of the vertex 
with respect to B and I + v = k A is the total degree of 



A. Recalling that kf n + r + v = kf ot , Eq. 5 becomes: 
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2v-X- 



h 13 h 



M 



(22) 



Therefore A and B would be kept separated when: 



A> 



2Mv 

UB h,A ' 



(23) 



By increasing A we can actually separate some vertices 
of S and we would eventually split it in ns clusters when 
A > — , where x is the minimum fokj over all the con- 
nected vertices (i, j) of S. Similarly, the condition for the 
cliques not to be split reads: 



A< 



2M 



(n c -l)(n c -2y 



(24) 



since the denominator is the total degree of a clique of 
nc — 1 vertices (we neglected r) and we considered k A = v 
(the vertex does not have external connections). 

In conclusion, if there are two connected vertices in 
S such that the product of their degrees is smaller than 
(nc — l)(nc — 2), no values of A are suitable to guess 
the right answer (s). This is very likely to happen if the 
degree distribution of S is broad, so that there are many 
low-degree vertices. 
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Figure 5: (Color online) Threshold parameters Ai and A2 as 
a function of ns (nc = 13, (k)s — 100). The theoretical 
line for Ai is obtained by approximating as as described in 
the text. We see that Ai > A2, up to ns ~ 230, so that 
no A can eliminate the biases for bigger values of ns • When 
ns <~ 230, the biases can be both eliminated only in the 
shadowed area between the curves. 



III. TESTS ON BENCHMARK GRAPHS 

We want now to check the practical consequences of the 
limits of multiresolution modularity. For that we take the 
LFR benchmark, a model of graphs with built-in commu- 
nity structure that we have recently introduced [24] . It is 
an extension of the planted £-partition model introduced 
by Condon and Karp [25]. Each graph has power law 
distributions of degree and community size, which are 
common features of real graphs with community struc- 
ture. The degree of mixture between clusters is measured 
by the mixing parameter /i, expressing the ratio between 
the number of neighbors of a vertex outside its commu- 
nity and the total number of neighbors. So \i = in- 
dicates that clusters are topologically disconnected from 
each other, as each vertex has neighbors within its com- 
munity only, while /i = 1 indicates that vertices are con- 
nected only to vertices outside their group, so the groups 
are not communities. Vertices are linked to each other at 
random, compatible with the constraints on the distribu- 



tions of degree and community size and to the fact that 
fi has to be (approximately) the same for all vertices. So 
the clusters are essentially random subgraphs. 

We want to specialize Eq. ([5| to the LFR benchmark 
graphs. Let us consider a cluster S with n^ nodes, total 
degree 2ra6 and internal degree 2Msb- We split it into 
two equal-sized subgraphs such that the internal degree 
of either part is the same: kf n = kf n . Moreover, for 
simplicity we assume that the split is done such to keep 
an equal number of edges between each of the subgraphs 
and the rest of the network: / = r. We have M$b = 
(1 - {i)m b , I = r = [im b , v = a S bM S b = a<sb(l - n)m h . 
The condition of non-splitting is: 



2v > A 



(M Sb + If 
M 



which is: 



So, 



2asb(l - fj)m b > A— ^. 



A < Ai where Ai = 2asb(^ — aO 



M 



(25) 



(26) 



(27) 



We now search for the condition that leads to the merger 
of two clusters of an LFR benchmark graph. For that we 
should know how many edges they share, which depends 
on the graph size and the number of clusters. We call 
the number of edges between modules x and y and 
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2m x and 2m y their total degrees. Eq. ([5| becomes 



2v x 



4m x m y 



(28) 



The condition to keep the clusters separated is A > A2, 
where 



A 2 



MVxy 

2m x m y ' 



(29) 



So, the two biases can be simultaneously removed iff Ai > 
A2, which amounts to 



2a Sh (l-fjb) — > y -. 

m^ 2m x m y 



(30) 



The inequality of Eq. (30) has to hold for all triples of 
clusters x, y and 6, and this is usually unlikely to hap- 
pen. In order to show that, we check whether multireso- 
lution modularity is able to deliver the planted partition 
of the LFR benchmark graphs for any value of the res- 
olution parameter A. The results are shown in Figs. [6] 
and [7| We plot the fraction of vertices which are incor- 
rectly classified by modularity as a function of A. We just 
consider misclassifications caused by merging (circles) or 
splitting (squares) the clusters of the planted partition 
of the graphs. We see that, for small values of A, mod- 
ularity merges many clusters and essentially splits none, 
whereas for large A there is a dominance of splitting over 



merging. The plots clearly show that, for every value 
of A, there will be some misclassification due to cluster 
merging, splitting or both. The fraction of affected ver- 
tices does not go below 10% but it can be considerably 
larger. Fig. prefers to graphs with 10000 vertices, but the 
situation does not improve if we go to larger graph sizes 
(50000 vertices for the benchmark graphs used for Fig. [7]). 
We point out that we have chosen low values of the mix- 
ing parameter ji (0.1 and 0.3), corresponding to clusters 
which are well separated from each other. Modern algo- 
rithms for community detection (like Infomap [26] and 
OSLOM [27]) would easily find the correct partitions in 
the graphs we have used for the tests of Figs. [6] and [7] 
(see Ref. [H]). One may object that our estimate of 
the modularity maximum for each graph is just an ap- 
proximation of the actual result, whose search is an NP- 
complete problem [28]. However, we have checked in each 
case that the partitions found have a higher modularity 
than the planted partition of the benchmark graphs. 
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Figure 6: (Color online) Test of multiresolution modularity 
on LFR benchmark graphs. Each panel shows the fraction 
of misclassined vertices due to artificial mergers (circles) and 
splits (squares) of clusters, as a function of the resolution 
parameter A. The panels correspond to different choices of 
the exponent T2 of the cluster size distribution of the graph 
and of the mixing parameter fi. Each point represents an 
average over 100 benchmark graphs. All graphs have 10000 
vertices. The other parameters are: average degree (k) = 20; 
maximum degree k ma x = 100; minimum cluster size Cmin = 
10; maximum cluster size c macc = 1000; degree exponent n = 
2. 



Finally we would like to check how general our results 
are. We have focused on the multiresolution modularity 
proposed by Reichardt and Bornholdt in Ref. [18]. In 
this paper, however, the authors had proposed a general 
ansatz for the quality function, and their multiresolution 
modularity was just a specific case of it. In a recent 
work [29 , Traag et al. have shown that this ansatz can 
be specialized to include other known measures, like the 
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Figure 7: (Color online) Same as Fig. [6] but for LFR bench- 
mark graphs of 50000 vertices. All other parameters are the 
same as for the graphs used in Fig. [6] 



Fig. [9j in which we take LFR benchmark graphs with 
the same parameters as those used for Fig. [6| Now we 
have 10000 vertices and cluster sizes vary from 10 to 1000 
vertices. Again, for the multiresolution methods we use 
the values of the resolution parameters that give the best 
results. The figure shows that the multiresolution meth- 
ods fail to detect the planted partition even for very low 
values of the mixing parameter /i, especially when the 
cluster size distribution is broader (j2 = —2). This is 
consistent with the results of Figs. [6] and [7| Infomap and 
OSLOM, on the other hand, have a clearly better perfor- 
mance, despite the fact that they do not have a tunable 
resolution parameter. In particular, Infomap always de- 
tects the right partition, for the range of \i explored here. 
Most networks of current interest have many more than 
10000 vertices, and accordingly community sizes span 
much broader ranges of values. Fig. [9] suggests that in 
such cases the performance of multiresolution methods 
might become far worse. 



multiresolution modularity by Arenas et al. [19], and the 
quality function adopted by Ronhovde and Nussinov [30] , 
which is characterized by not having a null model term, 
in contrast to modularity. In fact, Traag et al. derived 
another model from the general class of functions of Re- 
ichardt and Bornholdt, which they called Constant Potts 
Model (CPM), which allegedly has no resolution limit. In 
Fig. [8] we reproduce the results of the comparative anal- 
ysis performed by Traag et al. on the LFR benchmark. 
Here we compare five methods: Infomap, OSLOM, the 
optimization of the multiresolution modularities of Re- 
ichardt and Bornholdt (RB) and Arenas et al. (AFG), 
and the CPM by Traag et al.. For each selected value of 
the mixing parameter \i we generated 100 realizations of 
the LFR benchmark, and averaged on them the values 
of the similarity between the detected and the planted 
partition. As similarity measure we took the Normal- 
ized Mutual Information (NMI) [31], which has become 
a standard in this kind of evaluations. In our compu- 
tations we used a modified version of the measure [32 , 
recently introduced by the authors of this paper, that is 
able to estimate the similarity of partitions as well as the 
similarity of covers, i.e., of divisions of a network into 
overlapping communities. We have used this version of 
the NMI in our comparative analysis of community de- 
tection algorithms [17], so we stick to it for consistency. 
We stress however that the clusters of the graphs we con- 
sidered are not overlapping. 

As found in Ref. [29] , it is possible to find values of the 
resolution parameter for RB and CPM, that make these 
methods outperform both Infomap and OSLOM. This 
holds for AFG as well, whose performance is essentially 
identical as RB. However, this is due to the fact that the 
cluster sizes are too close to each other, spanning less 
than one order of magnitude. This is demonstrated by 
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Figure 8: (Color online) Comparative analysis of several mul- 
tiresolution techniques on the LFR benchmark. The graphs 
are made of 1000 and 5000 vertices, the exponent of the degree 
distribution n = 2, the exponent of the clusters size distri- 
bution T2 = 1, the average degree (k) = 20, the maximum 
degree k ma x = 50, the cluster size ranges are S = [10, 50] and 
B =[20: 100]. 



IV. CONCLUSIONS 

We have shown that multiresolution modularity max- 
imization is characterized by two concurrent biases: the 
tendency to merge small clusters and to split large ones. 
We have seen that it is usually very difficult, and often 
impossible, to tune the resolution such to avoid both bi- 
ases simultaneously. Tests on artificial benchmark graphs 
with community structure indeed show that a consider- 
able fraction of vertices is misclassified, for any value of 
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Figure 9: (Color online) Comparative analysis of several mul- 
tiresolution techniques on the LFR benchmark. The network 
parameters are now the same as for the graphs used in Fig. [6] 
In particular, the network size is 10000 and the cluster size 
spans two orders of magnitude. The two panels correspond 
to T2 = 2 (left) and T2 = 3 (right). 



the resolution parameter, even when clusters are well sep- 
arated and easily identified by other methods. Since, in 
practical applications, one knows very little about the 
community structure of the graphs at study, it is impos- 
sible a priori to quantify the systematic error induced 
by the use of modularity. Moreover, it is not easy to 
implement a way to "heal" the partition delivered by 
modularity, just because there are two sources of errors. 
If modularity simply combined smaller clusters in larger 
ones, as people have been thinking until now, one could 
hope to recover the real partition by looking inside the 
clusters delivered by modularity. Instead, since clusters 
can be both split and merged, the real partition must 
be recovered by splitting some clusters and merging oth- 
ers, and it is very difficult to understand which clusters 
contain smaller ones and which others are parts of larger 
clusters instead. This would require a careful exploration 
of groups of clusters. 

Our results hold for various types of quality functions, 
including the recently introduced Constant Potts Model 
by Traag et al. [29]. One could argue that, after all, mul- 
tiresolution methods have a remarkable performance in 
some cases (see Fig. [8| and a poor one in others (see 
Fig. |9|, just like any method, including Infomap and 
OSLOM (from the same figures). This objection is how- 
ever not sustainable, since we believe that, when clusters 
are so weakly connected to each other that one could even 



distinguish them by visual inspection, a good method 
cannot fail to detect them. While this is a shared view 
among scholars, it is still unclear where to set the limit of 
fuzziness between subgraphs that separates a regime in 
which they are clusters from one in which they are not. 
This problem has attracted some attention lately [33|l34] . 
So, in the tests we reported (Figs. [8] and [9]) it is not clear 
up to which value of the mixing parameter \i the sub- 
graphs of the benchmark graphs are still "significant" 
clusters, beyond random fluctuations. But there is no 
doubt that they are cluster for very small values of the 
mixing parameter \i. 

We want to stress here that we are not advocating 
the superiority of some methods over others. The prob- 
lems that we point out in this paper are probably com- 
mon to many other methods. Infomap itself, for in- 
stance, is a method based on the optimization of a global 
measure, like modularity, and is likely to have a resolu- 
tion limit as well, although it probably emerges only on 
large networks. In addition, it may also break random 
subgraphs, although its performance is perfect for well 
separated communities in all tests we have performed. 
OSLOM could be also improved, since it occasionally fails 
to detect the right partition for small \i. Still, at vari- 
ance with mult iresolut ion methods, neither Infomap nor 
OSLOM have a tunable resolution parameter, so their 
performance is quite remarkable. 

We conjecture that the tendency to simultaneously 
merge and split clusters is an inevitable feature of meth- 
ods based on global optimization, and that it could be 
more easily circumvented by local approaches. Global 
optimization techniques work well when clusters are ap- 
proximately of the same size; if clusters span a broad 
range of sizes, which is likely to happen on very large 
networks, such techniques get confused and may fail to 
detect some of the clusters, even when they are clearly 
identifiable. Resolution parameters improve things, but 
they do not (cannot?) solve the problem. 

We hope that the scientific community working on the 
problem of community detection will address this issue in 
the future, and that general structural limits of classes of 
methods will be identified and, possibly, removed. In this 
way it will be possible to define safe guidelines to design 
new methods that do not suffer from such problems and 
that therefore could be more reliable in practical appli- 
cations. 



Acknowledgments 

We gratefully acknowledge ICTeCollective, 
238597 of the European Commission. 



grant 



[1] M. Girvan and M. E. Newman, Proc. Natl. Acad. Sci. 
USA 99, 7821 (2002). 



[2] S. Fortunato, Physics Reports 486, 75 (2010). 

[3] R. Albert and A.-L. Barabasi, Rev. Mod. Phys. 74, 47 



(2002). 
[4] S. N. Dorogovtsev and J. F. F. Mendes, Adv. Phys. 51, [20 

1079 (2002). 
[5] M. E. J. Newman, SIAM Rev. 45, 167 (2003). [21 

[6] R. Pastor- S at orr as and A. Vespignani, Evolution and 

Structure of the Internet: A Statistical Physics Ap- [22 

proach (Cambridge University Press, New York, NY, 

USA, 2004). [23 

[7] S. Boccaletti, V. Latora, Y. Moreno, M. Chavez, and 

D. U. Hwang, Phys. Rep. 424, 175 (2006). [24 

[8] G. Caldarelli, Scale- free networks (Oxford University 

Press, Oxford, UK, 2007). [25 

[9] A. Barrat, M. Barthelemy, and A. Vespignani, Dynami- 
cal processes on complex networks (Cambridge University [26 

Press, Cambridge, UK, 2008). 
[10] R. Cohen and S. Havlin, Complex Networks: Structure, [27 

Robustness and Function (Cambridge University Press, 

Cambridge, UK, 2010). [28 

[11] M. E. J. Newman and M. Girvan, Phys. Rev. E 69, 

026113 (2004). 
[12] M. E. J. Newman, Proc. Natl. Acad. Sci. USA 103, 8577 

(2006). [29; 

[13] R. Guimera, M. Sales-Pardo, and L. A. Amaral, Phys. 

Rev. E 70, 025101 (R) (2004). [30 

[14] J. Reichardt and S. Bornholdt, Physica D 224, 20 (2006). 
[15] S. Fortunato and M. Barthelemy, Proc. Natl. Acad. Sci. [31 

USA 104, 36 (2007). 
[16] B. H. Good, Y.-A. de Montjoye, and A. Clauset, Phys. [32 

Rev. E 81, 046106 (2010). 
[17] A. Lancichinetti and S. Fortunato, Phys. Rev. E 80, [33 

056117 (2009). 
[18] J. Reichardt and S. Bornholdt, Phys. Rev. E 74, 016110 [34 

(2006). 
[19] A. Arenas, A. Fernandez, and S. Gomez, New J. Phys. 



10, 053039 (2008). 

A. Clauset, M. E. J. Newman, and C. Moore, Phys. Rev. 
E 70, 066111 (2004). 

G. Palla, I. Derenyi, I. Farkas, and T. Vicsek, Nature 
435, 814 (2005). 

F. Radicchi, C. Castellano, F. Cecconi, V. Loreto, and 
D. Parisi, Proc. Natl. Acad. Sci. USA 101, 2658 (2004). 
J. Reichardt and S. Bornholdt, Phys. Rev. E 76, 015102 
(R) (2007). 

A. Lancichinetti, S. Fortunato, and F. Radicchi, Phys. 
Rev. E 78, 046110 (2008). 

A. Condon and R. M. Karp, Random Struct. Algor. 18, 
116 (2001). 

M. Rosvall and C. T. Bergstrom, Proc. Natl. Acad. Sci. 
USA 105, 1118 (2008). 

A. Lancichinetti, F. Radicchi, J. J. Ramasco, and S. For- 
tunato, PLoS ONE 6, el8961 (2011). 
U. Brandes, D. Delling, M. Gaertler, R. Gorke, 
M. Hoefer, Z. Nikolski, and D. Wagner (2006), URL 
http : //digbib . ubka . uni-karlsruhe . de/volltexte/ 
documents/3255 

V. A. Traag, P. Van Dooren, and Y. Nesterov, Phys. Rev. 
E 84, 016114 (2011). 

P. Ronhovde and Z. Nussinov, Phys. Rev. E 81, 046114 
(2010). 

L. Danon, A. Diaz-Guilera, J. Duch, and A. 
Stat. Mech. P09008 (2005). 
A. Lancichinetti, S. Fortunato, 

11, 033015 (2009). 

G. Bianconi, P. Pin, and M. Marsili, Proc. Natl. Acad. 
Sci. USA 106, 11433 (2009). 

A. Lancichinetti, F. Radicchi, and J. J. Ramasco, Phys. 
Rev. E 81, 046110 (2010). 



Arenas, J. 
J. Kertesz, New J. Phys. 



